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Abstract 

Predicting the popularity of items in rating net'works is an interesting but challenging 
problem. This is especially so "when an item has first appeared and has received very fe'w 
ratings. In this paper, -we propose a novel approach to predicting the future popularity 
of new items in rating networks, defining a new bipartite clustering coefficient to predict 
the popularity of movies and stories in the MovieLens and Digg networks respectively. 
We show that the clustering behaviour of the first user who rates a new item gives insight 
into the future popularity of that item. Our method predicts, with a success rate of over 
65% for the MovieLens network and over 50% for the Digg network, the future popularity 
of an item. This is a major improvement on current results. 

Keywords: Bipartite networks. Clustering coefficient. Rating networks. Popularity 
prediction. Ego networks 


1. Introduction 


Websites like Amazon, TripAdvisor and MovieLens offer their users a means to rate 
a variety of different items. Users can decide whether they are interested in an item 
based on its previously received ratings. Websites collect these user ratings for many 
reasons including recommending items to their users and predicting future item ratings 


(Resnick et al. 1994). The latter is rather challenging. In particular, new items that have 
received very few ratings to date are hard to classify as being popular or unpopular in 


the future, due to the sparsity of information (Schafer et al., 2007). It has been suggested 


that a ranking of the users may aid in improving predictions of future item popularity 
(Zeng et al., 2013). In other words, the behaviour of some users may be adopted by 


others. In this paper, we predict the future popularity of new items by investigating 
the ego network of the user who first rated the new item. We use the ego’s clustering 
behaviour to predict the number of ratings and the average rating the new item is likely 
to receive in the near future. We demonstrate our approach on the MovieLens network 


(GoupLens 

2014 

) that contains ratings of 10,681 movies by 71,567 different users and the 

Digg network ( 

KONECT 

2014 

) that contains ratings of 3,553 stories by 139,409 users. 


over 50% of the time we are able to predict the future popularity of new stories compared 


to a 30% and 20% prediction success respectively obtained by the current method (Zeng 


et al. 2013). 






























Ranking the nodes of a network according to their importance or influence level is of 


high interest in many different areas and many ranking methods have been proposed (Aral 


& Walker 

2012 

Liu et al. 

2013 

Ren et al. 

2014 

Wei et al. 

2013 

Zhang et al. 

2013 


A straightforward approach is to consider the node degrees. However, degree central¬ 
ity is often unsuitable for identifying influential nodes, as a node with a few important 
neighbours may be more important than a node with a large number of unimportant 
neighbours (Li et al. 2014). Google’s PageRank (Page et al., 1999) takes this notion into 
account. This was improved by Lii et al. (2011), who proposed a parameter-free algo¬ 
rithm called Leader Rank that performs better than PageRank in many aspects. Another 
approach that uses k-shell decomposition and shows that the most important nodes of a 
network lie inside its core (Kitsak et al. 2010), was outperformed by a mixed degree de¬ 
composition proce dure (|Zeng fc Zhang 2013) that considers the residual and exhausted 
degree of a node. Chen et al. (2013b) point out that it is of advantage to design algo¬ 
rithms that are based on local information since networks are becoming larger. They 
propose a method that for the first time takes path diversity into account. Path diversity 
is very important in the identification of influential spreaders, since a node with many 
overlapping propagation paths may not spread, information for example, very efficiently. 

All of the above approaches to finding influential nodes are designed for one-mode 
networks. In rating systems, on the other hand, users are not directly connected as all 
interactions take place between users and items. These systems are best represented by 
dynamic bipartite networks (Zhou et al. 2007). The nodes of a bipartite network can be 
partitioned into two disjoint sets, the primary and secondary node sets, such that edges 


only connect nodes from different sets (Asratian et al. 
primary nodes, the users, are denoted by Ui, whereas secondary nodes, the items 
denoted by Wj. 


1998[ ). Throughout this paper, 

are 


Zeng et al. (2013) predict the popularity of items in three different bipartite networks. 


They define popularity by the increase in degree, meaning that an item with a high 
increase in ratings is considered as popular. It is reasonable to assume that popular 
items are rated more frequently, but there may be exceptions. Some items may receive 
a relatively high number of low ratings and consequently should not be considered as 



some time, with a success rate of 72% for MovieLens, the fraction of new items correctly 
identified as popular in the future is small, approximately 30%. The success rate for new 
items in the Digg network was 20% and could be increased to 60% by using the friendship 
network of the Digg users that is also available. Since for most bipartite rating networks, 
the underlying friendship network of users is not available we will not consider it in this 
study. 

The main focus of this paper is to improve the prediction rate of new items. To do so, 
we propose a more sophisticated method, using the clustering coefficient. The clustering 


coefficient has been used in one-mode networks to identify influential nodes (Chen et al. 


2013a). These influential nodes were found to spread information more quickly than 


those found via PageRank or LeaderRank. In their paper Chen et al. (2013a) investigate 
whether a node with lower clustering attracts more connections than a node with higher 
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clustering. Their results clearly show that a node with a small clustering coefficient will 
build more connections in the future. Instead of focusing on the spreading abilities of 
nodes (users), our method improves predictions, especially of those items that are new 
in the system. Our approach only considers network topology and is suitable for data 
such as MovieLens, which does not record any particulars of the users, such as age or 
gender. For our method to work, the knowledge of single ratings, whether high or low, 
is also unnecessary. 

The rest of this paper is organised as follows: In Section]^ we define a new clustering 
coefficient that suits the analysis of rating networks. Section describes the analysed 
datasets and defines the term popularity in the context of rating networks. In Section]^ 
we use our clustering coefficient as a tool to predict future popularity of new items, and 
present the results. Finally, we give conclusions and future work. 


2. The Clustering Coefficient 

The clustering coefficient was originally defined for one-mode networks and, globally. 


measures the concentration of triangles (Newman 2010). A bipartite network does not 


contain any cycles of odd length and hence is triangle free (Asratian et al. 1998). 


In 

Liebig & Rao ( 

2014) 

we looked at previous definitions of the bipartite clustering 

coefficient (Lind et al 

2005 

Opsahl 2013 fRobins &: Alexander 2004 Zhang et al. 

2008) 

showing that it is important not only to consider triadic closure ( 

Opsahl 2013) 

but to 


also distinguish between differently structured clusters. A cluster is a closed connection 
between three nodes of the same type and a 6-cycle is the only structure that could be 


considered a bipartite cluster (Opsahl, 2013). 


Opsahl (2013) calculates the bipartite clustering coefficient as follows: 

number of closed 4-paths 


C* = 


number of 4-paths 


( 1 ) 


where a 4-path is a possible 6-cycle and a closed 4-path is a 6-cycle. Suppose 
there exists a 4-path P = {vQ,wi,V 2 ,w^,Vi\ at time ti. At time U+i a node is 
added to the network and connected to nodes Vq and V 4 , forming the 6-cycle C® = 
{vo,Wi,V2,W3,V4,W5,Vo}. 

A 6-cycle in a bipartite network may have a maximum of three chords, resulting in 
four different types of cycles of length 6, see Fig. A chord is an edge that connects two 
nodes of a cycle but is not itself part of the cycle. A cycle without chords is called an 
induced cycle ( |Diestel[ 2005). Opsahl (2013) does not distinguish between the different 
structures, leading to an over count of 6-cycles (Liebig & Rao 2014). 

Further, Eq. ([^ indirectly assumes that the network in question is formed in a certain 
way: At each time step ti one secondary node is added to the network and connected 
to an arbitrary number of primary nodes. In reality, only some networks develop in 
this manner, for instance, a network that models the attendance of people at events. 
In such a network, at each time step ti an event is added to the network and people 
make connections to this new event at the same time by attending it. Since a person 
can attend an event only at the time it takes place, people can form a connection to a 
particular event only at a particular point in time. We studied such types of networks 
in ( [Liebig fc Rao[[^0I4| ). 
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(c) A 6-cycle with two chords. (d) A 6-cycle with three chords. 

Figure 1: There are four different possible types of 6-cycles in a bipartite network: (a) 
an induced 6-cycle, (b) a 6-cycle with one chord, (c) a 6-cycle with two chords and (d) 
a 6-cycle with three chords. Chords are represented by dashed lines. 


Rating networks, on the other hand, develop very differently. Users can form con¬ 
nections to items at any point in time. In other words, at each time step U one edge is 
added to the network and the assumption made by Eq. 0 is violated, as there is no 
necessity for a secondary node to be added at the same time, in order for a 6-cycle to be 
formed. Rating networks build a very different type of bipartite network and need to be 
analysed differently. 

To find a clustering coefficient that agrees with the development, over time, of rating 
networks, we examine the formation of the different 6-cycles, see Fig. [l] in rating net¬ 
works. In the following we make use of induced subgraphs. A subgraph of a network is 
induced if it contains all the edges that connect the nodes of the subgraph in the original 
network ( Diestel[ |2005[ ). The different ways in which 6-cycles in rating networks may be 
formed are depicted in Fig. 

Equations 0-0 give four novel clustering coefficients icc{k)^ one for each type of 
6-cycle, that suit the analysis of rating networks. 

The induced clustering coefficient is given by: 


6(71 


*CC(o) 


( 0 ) 


6(7(0) -I- K(o) 


( 2 ) 


where (7(o) is the number of induced 6-cycles and «:(o) is the number of unconnected 

5- paths. The induced clustering coefficient icc(o) measures the proportion of induced 

6- cycles to all unconnected paths of length 5 (Fig. 2aI. The number of induced 6-cycles 


is multiplied by 6, as each induced 6-cycle contains six unconnected 5-paths. 
The one chord clustering coefficient is given by: 


7(7, 


ZCC(I) 


( 1 ) 


7(7(1) -b (7, 


( 0 ) 


^( 1 ) 


( 3 ) 
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(a) We call the origin of an induced 
6-cycle an unconnected 5-path. 



(c) A 6-cycle with one chord may form 
a 6-cycle with two chords at the next 
time step. We call a 5-path that 
contains two extra edges between its 
nodes that do not belong to the path, 
a completely connected 5-path. Any of 
the two different completely connected 
5-paths above may also form a 6-cycle 
with two chords. 



(b) An induced 6-cycle may form a 6- 
cycle with one chord at the next time 
step. We call a 5-path that contains 
an extra edge between two of its nodes 
that does not belong to the path, a 
connected 5-path. Any of the three 
different connected 5-paths above may 
also form a 6-cycle with one chord. 



(d) A 6-cycle with two chords may 
form a 6-cycle with three chords at the 
next time step. 


Figure 2: All possibilities by which the different 6-cycles may be formed in a rating 
network. 


where cr(i) is the number of induced 6-cycles with one chord and K(i) is the number 
of connected 5-paths. The one chord clustering coefficient i cc(i ) measures the proportion 
of 6-cycles with one chord with respect to its origins (Fig 2bI. The number of 6-cycles 


with one chord is multiplied by 7, as each contains two of each of the connected 5-paths, 
shown in Fig. |2b[ and one 6-cycle without any chords. 

The two chord clustering coefficient is given by: 


4(7, 


ZCC(2) 


( 2 ) 


4ar 


(4) 


7(2) -I- (7(1) + K(2) 

where (7(2) is the number of induced 6-cycles with two chords and K( 2 ) is the number 
of completely connected 5-paths. The two chord clustering coefficient icc( 2 ) measures the 
proportion of 6-cycles with two chords with respect to its origins (Fig. 2cI. The number 
of 6-cycles with two chords is multiplied by 4, as it contains one of each of the completely 
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connected 5-paths, shown in Fig. and two 6-cycles with one chord. 
The three chord clustering coefficient is given by: 


3(71 


(3) 


( 5 ) 


*CC(3) = — 

3o’(3) + cr('2) 

where (7(3) is the number of induced 6-cycles with three chords. The three chord 
clustering coefficient icc( 3 ) measures the proportion of 6-cycles with respect to its origin 
(Fig. 2d). The number of 6-cycles with three chords is multiplied by 3, as it contains 


three 6-cycles with two chords. 

In a one-mode network, the local clustering coefficient of a node Vi is calculated by 
dividing the number of closed 2-paths that are centred at node Vi by the total number of 
2-paths that are centred at node Vi. In rating networks, most clusters are formed from 
5-paths (see Fig. [^. As a path of odd length can never be centred at a node, we consider 
all paths that involve node Vi in order to calculate u^’s clustering coefficient. The local 
clustering coefficients are denoted *cc(i;,.). 


3. The Data 

To test our prediction method, we have chosen two different datasets. 
3.1. The MovieLens Dataset 


The MovieLens data (GoupLens 20141 was collected by the University of Minnesota 


and contains 10,000,054 movie ratings that range between 1 and 5, with 5 being the best 
possible rating. Starting in January 1995, 71,567 different users rated 10,681 movies over 
a period of 14 years. Every user was assigned a unique id but no additional information 
about the users is known. A network is formed taking the users as the primary nodes 
and the movies as the secondary nodes. Each rating of a movie by a user is represented 
by an edge that links the user to the movie. Every edge is associated with a time stamp 
that corresponds to the time the rating was made. 

3.2. The Digg Dataset 

Digg (http://digg.com/) is a website that presents news stories and allows users to 
vote for them. This bipartite network contains 3,018,197 votes cast by 139,409 users. A 
total of 3,553 stories were rated over a period of one month in 2009. Unlike the MovieLens 
network, edges are not associated with a rating. An edge between a user and a story 


indicates that the user liked the story. The data was obtained from KONECT (2014). 


4. Predicting Popularity 

In this paper, we predict the popularity of newly released items using a novel ap¬ 
proach. Eirstly, we directly analyse the bipartite network, without using its one-mode 


projection in order to overcome information loss (Conaldi et al. 2012 Vogt & Mestres 


2010 Zhou et al. 2007). Secondly, we use our newly defined clustering coefficient as a 


tool to make accurate item popularity predictions. 
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4-1- What does popularity mean? 

Before making predictions about an item’s future popularity, it is important to clearly 
define the meaning of popular and unpopular. 

In previous research, (Zeng et al. 2013), an item was considered popular if its degree 
increased rapidly over a certain period of time. Intuitively, a popular movie, for example, 
is watched and rated more often than an unpopular movie, however, there are excep¬ 
tions. For instance, some movies may receive a relatively high number of low ratings and 
certainly should not be considered as popular. 

1999 [de Solla 


In contrast to the preferential attachment model (Barabasi & Albert 


Price, 1976) that predicts that nodes with a high degree are more likely to increase their 


degree than nodes with a low degree, in rating networks it is generally the case that the 
interest in an item, such as a movie, decays over time (Medo et al. 2011). The MovieLens 
data shows that, although movies are rated over a longer period of time, in many cases 
the ratings made within one month of the movie’s release determine its final average 
rating, see Fig. SI in the supplementary material. In case of the Digg dataset an item is 
frequently rated within 48 hours after the initial rating. After this period the interest in 
the item decays very quickly, see Fig. S2 in the supplementary material. Hence there is 
a need for making good early predictions. 

We calculate the average number of ratings that items receive during this critical 
period. We define the critical period as the time period that most affects the average 
rating of an item. In case of the Digg network, where edges are not associated with a 
rating, the critical period is the time span in which stories are most frequently rated, i.e. 
within 48 hours of the initial rating. An item is considered as popular if it receives a 
higher number of ratings than the average item and obtains a high average rating. In the 
MovieLens dataset for instance, a movie received 29 ratings on average during the first 
month. Hence a movie is considered as popular if it receives 29 ratings or more during 
the first month and obtains an average rating greater or equal to 4. 

We calculate a popularity score, p, for each item based on the number of ratings 
received during the critical period and the average rating at the end of the critical period. 
In order to achieve a score that ranges between 0 and I, a logistic function is used. A 
logistic function is an s-shaped curve that is frequently used to model population growth 
(Pearl & Reed 1920). The function grows exponentially first with the slope decreasing 
thereafter until the function reaches a steady state. The popularity score p is given by: 


pip.,n) 


1 

1 _(_ g-k(im-c) ’ 


( 6 ) 


where p, is the average rating, n is the number of ratings received within the critical 
period and c and k are constants. The constant c is chosen as follows: In the MovieLens 
dataset for instance, a movie is considered as popular if it received 29 ratings or more 
and obtained an average score of 4 or higher. The constant c is chosen such that a movie 
that received exactly 29 ratings and an average score of 4 receives a popularity score 
of p = 0.5. The constant k is chosen such that an item without any ratings receives 
a popularity score of approximately equal to 0. See Section S2 in the supplementary 
material for calculations of c and k for the different datasets. Figure shows the change 
in the shape of p{p, n) as the average rating p changes in the case of the MovieLens 
dataset. Figure S3 in the supplementary material shows a plot of p{p,n) in the form of 
a heat map to give a clear visualisation. 
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Figure 3: We plotted Eq. ®> for five different values of fj,, in order to show the corre¬ 
sponding change in the shape of the logistic curve. Clearly, for a low average rating /r a 
large number of ratings is necessary to achieve a high popularity score. 


Some rating systems may not give their users the opportunity to rate an item on 
a scale. One example is the Digg network, where an edge between a user and a story 
indicates that the user liked the story. To be able to calculate the popularity score p for 
items in such networks, we assign a rating of 5 to each edge. 

In case of the MovieLens network, we expect that the average rating of an item is to 
some degree related to the number of ratings it received. However, Fig. shows that 
just using the number of ratings to predict the average rating results in a large number 
of items (see lower right quadrant of the plot) being wrongly identified as popular. 

Since new items are hard to classify as popular or unpopular, we examine the user 
who is the first to rate the new item. We start by extracting the ego network of the user 
who first rated a new item, to depth three. In other words, we include all first, second 
and third neighbours of the ego and only allow edges corresponding to ratings made 
during a certain period of time prior to the first rating of the new item. Analysis of user 
activity showed that in the case of the MovieLens network, ratings made up to ten days 
prior to the first rating of the new movie have to be included. Any period less than ten 
days, in most cases, resulted in an ego network that only contained a single edge. Since 
the dynamics in the Digg network is much faster, a period of six hours prior to the first 
rating is sufficient. Since we consider the ego network of the first user, we henceforth 
refer to this user as the ego. A depth of three is necessary to be able to calculate the 
local clustering coefficients of the ego. 

Since the popularity score p of an item is dependent on both the number of ratings 
received as well as the average rating, the two parameters are separately predicted. The 
popularity scores thus obtained are compared to the actual popularity scores calculated 
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MovieLens 



number of ratings 


Figure 4: We plotted the movies’ average ratings /x against the number of ratings n they 
received, to examine their relationship. The correlation coefficient between the average 
rating and the number of ratings is low (0.183). If we were to consider only the number 
of received ratings to determine the movie’s popularity, many would be wrongly classified 
as popular, see lower right quadrant of the plot. All movies in the lower right quadrant 
received a relatively high number of ratings but have an average rating of less than 4. 


from the real data. 

4-2. Predicting the number of ratings 

As will be demonstrated below, both the ego’s degree, i.e. the ego’s rating activity, 
as well as the number of its second neighbours perform poorly as predictors, whereas, 
the ego’s clustering behaviour is a better predictor of the number of ratings that the new 
item will receive. 


4-2.1. The Ego’s Rating Activity 

In the context of one-mode networks, it has been argued many times that a node 
with high degree is not necessarily influential (e.g. Li et ah, 2014). This is also the case 
in the MovieLens and Digg networks. In other words, the rating of a new item by a 
highly active user does not imply that the item will receive many ratings. Figure S4 in 
the supplementary material demonstrates this. 


4-2.2. Second Neighbours of the Ego 

Since rating networks are bipartite, it would be more apt to consider the number of 
second neighbours of an ego as a predictor of an item’s number of ratings instead of the 
ego’s degree, as the latter only gives the the number of items rated by the ego. Second 
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neighbours of the ego are users who rated at least one item that was also rated by the 
ego. 

In addition, users who rated the same items as the ego in the recent past are more 
likely to rate the new item than a randomly selected user. Hence, the number of second 
neighbours of the ego, may give some indication of the number of ratings that the new 
item will receive in the near future. 

However, as depicted in Fig. S5 in the supplementary material, the number of second 
neighbours is also a poor indicator for the item’s future degree. 

4-2.3. The ego’s clustering behaviour 

We now examine the clustering behaviour of users who were the first to rate a new 
item. Table [l] shows that the extracted ego networks vary considerably. 




MovieLens 

Digg 


range 

[3, 4411] 

[25, 11769] 

size 

mean 

1681 

1521 


sd 

1060 

1504 


range 

[2, 540] 

[2, 38] 

mean degree 

mean 

82 

8 


sd 

69 

4 


range 

[0.0098, 1] 

[0.0038, 0.5217] 

density 

mean 

0.1065 

0.0397 


sd 

0.1793 

0.0477 


Table 1: The table shows the range, mean and standard deviation of the size, average 
degree and density of the extracted ego networks. 


To determine the ego’s clustering behaviour, we calculate the four different local 
clustering coefficients that we introduced in Section For the majority of the ego 
networks, the induced clustering coefficient is the highest, whereas in most cases the 
three other coefficients are 0. Clustering in all of the ego networks is low due to their 
scale-free nature. 

The four different clustering coefficients measure the proportions of induced 6-cycles, 
6-cycles with one chord, 6-cycles with two chords and 6-cycles with three chords respec¬ 
tively. In the case of one-mode networks Ugander et al. (20121 have shown that a node 
is less likely to be infected if its neighbourhood is well connected. Hence, we expect that 
when an ego’s clustering coefficient is lower than the average clustering coefficient in its 
ego network, then the new item will receive a high number of ratings in the near future. 


Chen et al. (2013b I have also shown that nodes with a low local clustering coefficient 


attract more connections. 

We compare the ego’s clustering behaviour to all other users in its ego network by 
calculating how many standard deviations it lies away from the average local clustering 
coefficient over all users in the ego network. We expect that a high difference in standard 
deviations indicates that the new item will receive many ratings in the future, provided 
the ego’s clustering coefficient is lower than the average. If on the other hand, the ego’s 
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clustering coefficient is higher than the average and the difference in standard deviations 
is high, we expect the new item to receive very few ratings in the future. 

Since the three chord clustering coefficient, icc( 3 ) shows higher connectivity than 
the induced clustering coefficient icc(o), we give the differences in standard deviations 
appropriate weights, according to their level of connectivity. The first rating of the new 
item is also taken into account, since it is likely to influence other users. For instance, if 
the first rating of a new item is low, it is less likely to receive many ratings than if the 
first rating is high. 

The following equation gives the predicted number of ratings n if the ego’s clustering 
coefficients are lower than the average: 


r 


n = -(2AicC(ego.O) + 3AicC(ego,l) + 4AicC(ego,2) + 5AicC(ego.3)) 


(7) 


where r is the first rating of the new item that was given by the ego and Aicc(^ego,k) 
is the difference in standard deviations between that particular clustering coefhcient of 
the ego and the average of the same clustering coefficient in the ego’s network. Since 
ratings range between 1 and 5, the initial rating r is divided by 3. Hence, a rating of 3 
is treated as neutral. 

If, on the other hand, one or more of the ego’s clustering coefficients are higher than 
the average, then we divide by the corresponding weight instead of multiplying. For 
example, if in a given ego network, icc(^ego,o) icc(^ego,i) are lower that the correspond¬ 
ing average clustering coefhcient and the other two local clustering coefhcients icC(^ego, 2 ) 
and zcc(ego. 3 ) are higher, then Eq. Q becomes: n = |(2Aicc(ego,o) + 3A*cc(ego.i) + 
AmC(ego,2)/4 -b AzCC(ego,3)/5). 

The reason for only using the hrst reviewer of an item to predict its popularity is 
that our aim is to make predictions as early as possible. Considering a combination of 
the Hrst few ratings may improve predictions, however, it comes with the disadvantage 
of having to make the predictions later in time. Whether the improvement of predictions 
is high enough to sacrihce early predictions will be answered in future work. 

Figure S6 in the supplementary material shows that the number of ratings was cor¬ 
rectly predicted for 53% of movies in the MovieLens network and 67% of stories in the 
Digg network. We took a closer look at the movies where the number of ratings was 
incorrectly predicted. Generally, the movies that received a higher number of ratings 
than predicted, received very mixed reviews by critics. We have listed some of these 
movies in Table SI in the supplementary material. Among the movies that received 
less ratings than predicted, are non-English Hlms. Usually these movies received good 
critique in their country of origin. We give a more detailed discussion in Section S4 of 
the supplementary material. 

4-3. Predicting the Average Rating 

We again make use of the logistic curve to estimate the average rating of items. Since 
ratings range between 1 and 5, the function values should also range between these values 
and hence 
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(8) 
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where c and k are constants. The constants are chosen such that f{n) « 1 if an 
item has a predicted number of ratings equal to zero and f{n) = 4 if an item has 
a predicted number of ratings equal to the number of ratings that the average item 
received. Calculations are shown in the supplementary material. 

Using Eq. ([^ together with the first rating r and the predicted number of ratings n, 
we can estimate the future average rating, fi, of the new movie: 

f^ = {f{n) +r)/2. (9) 

Equation Q predicts the future average rating well. However, the rating that is 
given by the first user has a large influence on other users. Therefore, we take the 
average between /(h) and the first rating to improve prediction of the future average 
rating of an item. 

With the two parameters, fl and h, the popularity score can now be predicted for 
each item. 


MovieLens Digg 




Figure 5: We plotted the actual popularity of items in the MovieLens (left) and Digg 
(right) networks as a function of the predicted popularity. For the MovieLens network 
we were able to correctly predict the future popularity of 66% of the movies. For the 
Digg network we achieved a success rate of 51%. 


We have predicted the popularity of all movies that were released between 2004 
and 2007 in the MovieLens network and 350 randomly chosen news stories in the Digg 
network. Figure compares the predicted popularity scores p (Eq. ®) to the actual 
popularity scores p and shows that our method correctly predicted the popularity of 638 
of the 962 movies in the MovieLens network, with the difference between the predicted 
popularity p and the actual popularity p being less than 0.05. This is a success rate 
of approximately 66%. This is especially good since no use was made of any other 
information about the new movie other than the first rating. In previous research, 
authors were only able to predict the popularity of a new movie with 30% accuracy 
(Zeng et al. 20131. In the Digg network we achieved a success rate of approximately 


51%, where we were able to correctly predict the popularity of 179 out of 350 news 
stories, compared to a 20% success rate achieved in (Zeng et al., 2013). 
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5. Conclusion 


In this paper, we introduced a new clustering coefficient that is suitable for the anal¬ 
ysis of rating networks as well as other bipartite networks that form in a similar manner. 
Since our clustering coefficient distinguishes between differently structured clusters it 
gives valuable topological information about the network. 

Further, we showed that it may be used as a tool to predict the future popularity of 
items in rating networks. We focused on improving predictions of new items that are 
hard to classify as popular or unpopular due to lack of information. Previous research 
of one-mode networks has shown that nodes with a low clustering coefficient attract 
connections in the future (Chen et ah, 2013aI. These nodes are also known to spread 


information faster than nodes with a higher clustering coefficient. Ugander et al. (2012) 
have also shown that a node is less likely to be infected if its neighbourhood is well 
connected. We consider the clustering coefficient as a predictor of popularity based on 
this previous research. Since a new item, at the time of prediction, has degree one, we 
examine the clustering coefficient of the user who rated the item first. If this user has a 
low clustering coefficient compared to the users in its neighbourhood, the missing edges 
may be added by building connections to the new item. 

By investigation of the clustering behaviour of the user who made the first rating of 
a new item we correctly predicted the future popularity of over 65% of new movies in 
the MovieLens dataset and over 50% of new stories in the Digg dataset. This is a major 
improvement on previous research. 

One limitation of the clustering coefficient is that computation time grows expo¬ 
nentially with the size of the network. Thus, a topic for future research would be the 
improvement of algorithms that count the different 5-paths and 6-cycles. 

Another very interesting topic of research is the identification of the, for instance 100, 
most popular items in the future. We aim to address this problem in future work. 
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Appendix A. Supplementary Figures 


Wedding Date, The 



Constantine 



Aliens of the Deep Boogeyman 




Inside Deep Throat 



Hitch 



Pooh’s Heffalump Movie 



Because of Winn-Dixie Oldboy 




Figure A.6: In the MovieLens network it is often the case that the average rating of a 
movie fluctuates during the first month after the initial rating, before becoming steady. 
Here, we have plotted nine examples. The cc-axis shows the time in form of the date and 
the y-axis displays the corresponding average rating. 
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Figure A.7: In the Digg network the user interest in an item decays very quickly. Here 
we plotted the differences in time between the ratings of 16 different stories. The y-axis 
shows the difference in hours to the previous rating and the x-axis shows the time in 
hours after the first rating. Most items are frequently rated during the first 48 hours 
after the initial rating. 
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Figure A.8: The heat map shows values of the popularity score p (Eq. (6)) with respect to 
the number of ratings and the average rating for the MovieLens network. Red corresponds 
to p = 1 and blue to p « 0. The heat map shows that a high value of p can only be 
achieved if the number of ratings as well as the average rating is high. The upper left 
corner of the heat map is also dark red, however, an item with a low average rating 
usually does not gain the requisite large number of ratings in order to receive a high 
popularity score. 
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Figure A.9: In order to demonstrate that the ego’s degree is not a good indicator for how 
many ratings the new item will receive, we have plotted the new items’ degrees (y-axis) 
after the critical period against the corresponding ego’s degree (x-axis). The hrst four 
plots correspond to the MovieLens network, the last four plots correspond to the Digg 
network. We considered ego degrees one, two, three and four weeks prior to the first 
rating in case of MovieLens. For the Digg network we considered ego degrees six hours, 
twelve hours, one day and two days prior to the hrst rating. In all cases no correlation 
is present (see title of each plot). 
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Figure A.10: In order to demonstrate that the ego’s number of second neighbours is not 
a good indicator for how many ratings the new item will receive, we have plotted the 
new items’ degrees (y-axis) after the critical period against the corresponding number of 
second neighbours of the ego (a;-axis). The first four plots correspond to the MovieLens 
network, the last four plots correspond to the Digg network. We considered ego degrees 
one, two, three and four weeks prior to the first rating in case of MovieLens. For the 
Digg network we considered ego degrees six hours, twelve hours, one day and two days 
prior to the first rating. In all cases no correlation is present (see title of each plot). 
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predicted numeber of ratings predicted numeber of ratings 


Figure A.11: We plotted the actual number of received ratings, n, as a function of the 
predicted number of ratings, n. For the MovieLens network (left), we correctly predicted 
the number of ratings for 53% of the movies. These movies are represented by dots 
within the two straight lines. For the Digg network (right), we correctly predicted the 
number of ratings for 57% of news stories. We give a detailed analysis of the movies in 
the MovieLens network where the number of ratings was incorrectly predicted in Section 
Appendix D 
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Appendix B. Calculation of the Constants in Eq. (6) 

The popularity score p{fi,n) is given by: p{p,n) = 1/(1 + see Eq. (6), 

where p is the average rating, n is the number of ratings received within the critical 
period and c and k are constants. 

The constants c and k are chosen such that an item that received as many rating as 
the average item and an average rating of 4 receives a popularity score of 0.5. An item 
without any ratings should receive a popularity score of approximately equal to 0. 

Hence, in case of the MovieLens network: 

0.5 = l/(l + e-'=(‘‘*29-^)) 

0.5(1+ = 1 

g-fc(116-c) _ ^ 

-fc(116-c) = 0 

c = 116 since k ^ 0 

and 

0.5-10"^ = l/(l + e"'=(°"^^®)) 

0.5-10-3(1+ = 1 

gll6fc ^ ^ ggg 

116A: = 7.6 
k = 0.066. 

Therefore p{p,n) = 1/(1 + g- 0 - 066 {fm-ii 6 )-^ MovieLens network. 

In the Digg network a story received on average six ratings during the Hrst two days. 
Since we associated every edge in the Digg network with a rating of 5, a news story that 
received six ratings and an average rating of 5 should receive a popularity score of 0.5. 
Hence, c = 30, fc = 0.253 and therefore p{p,n) = 1/(1 + e-°-^33(M"-30)^ £qj. ^j^g Djgg 
network. 

Appendix C. Calculation of the Constants in Eq. (8) 

The predicted average of a new item is given by f{n) = 1 + 4/(1 + where c 

and k are constants, see Eq. (8). The constants are chosen such that /(n) « 1 if an item 
has a predicted number of ratings equal to zero and f(n) = 4 if an item has a predicted 
number of ratings equal to the number of ratings that the average item received. In the 
MovieLens network the average item received 29 ratings and hence, 

0.5-10-3 = 4/(l + e-'=(°-^)) 

0.5-10-3(1 + 6^'=) = 4 

6=='= = 7999 
ck = 8.987 
c = 8.987/fc 
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and 


3 

3 (l_^g-fc( 29 -c)) 

g-fe(29-c) 

—29k + ck 
29k 
k 


4 

1/3 

-1.099 

10.086 

0.348. 


Therefore, f{n) = 1 + 4/(1 -|-£-0-348(^-25.825)^ MovieLens network. In case of 

the Digg network, the average rating does not need to be predicted, since we associated 
every edge with a rating of 5. 


Appendix D. Analysis of Movies 

We have predicted the popularity of all movies that were released between 2004 and 
2007 in the MovieLens network. Our method correctly predicted the number of ratings of 
510 of the 962 movies. This is a success rate of approximately 53%. Here we take a close 
look at the movies where the number of ratings was incorrectly predicted. We divide this 
set of movies into two subsets, one containing movies where the actual number of ratings 
is higher than predicted and the other containing movies where the actual number of 
ratings is lower than predicted. 

Amongst the set of movies where the actual number of ratings was lower than 
predicted, approximately 30% of movies are in languages other than English. These 
movies generally received very positive reviews by film critics. Hence our predictions 
overall agree with Him critics and the low actual number of ratings in the MovieLens 
dataset may be explained by the high number of English speaking users. Table |D.2| 
lists some of these movies together with the number of ratings that were collected 
by the websites Rotten Tomatoes (http://www.rottentomatoes.com/) and Metacritic 
(http://www.metacritic.com/). In addition, for the website Rotten Tomatoes the table 
displays the tomatometer score that represents the percentage of approved critics that 
have given the movie a positive review. For Metacritic we also show the metascore. The 
metascore ranges between 0 and 100, with 100 being the best possible score. 


Movie title 

actual 
number 
of ratings 
(average) 

predicted 
number 
of ratings 
(pre¬ 
dicted 
average) 

Rotten 

Tomatoes 

(average) 

Metacritic 

(average) 

Red Lights (Eeux rouges) (2004) 
Lost Embrace (El Abrazo Par- 

3 (2.88) 

2 (3) 

19 (2.17) 

17 (1.59) 

86 (83%) 

48 (83%) 

28 (74) 

23 (70) 


tido) (2004) 
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Movie title 

actual 
number 
of ratings 
(average) 

predicted 
number 
of ratings 
(pre¬ 
dicted 
average) 

Rotten 

Tomatoes 

(average) 

Metacritic 

(average) 

Sea Inside, The (Mar adentro) 
(2004) 

7 (4.11) 

19 (2.66) 

131 (84%) 

38 (74) 

Machuca (2004) 

7 (2.28) 

21 (3.08) 

33 (89%) 

37 (76) 

Tae Guk Gi - The Brotherhood 
of War (Taegukgi hwinalrimyeo) 
(2004) 

5 (4.06) 

37 (4.22) 

41 (80%) 

19 (64) 

Appleseed (Appurushldo) (2004) 

7 (4.21) 

22 (2.91) 

32 (25%) 

17 (40) 

Turtles Gan Fly (Lakposhtha 
ham parvaz mikonand) (2004) 

2 (3.5) 

12 (2.02) 

72 (88%) 

31 (85) 

Loop the Loop (a.k.a. Up and 
Down) (Horem padem) (2004) 

2 (4.25) 

40 (4.49) 

65 (83%) 

27 (78) 

Walk on Water (2004) 

3 (4.25) 

18 (3.13) 

75 (72%) 

28 (65) 

Look at Me (Gomme une image) 
(2004) 

5 (3.31) 

27 (4.17) 

98 (87%) 

30 (79) 

Year of the Yao, The (2004) 

1 (4) 

17 (2.6) 

33 (67%) 

11 (62) 

Three... Extremes (Saam gaang 
yi) (2004) 

8 (3.84) 

34 (4.38) 

62 (84%) 

22 (66) 

Bittersweet Life, A (Dalkomhan 
insaeng) (2005) 

4 (3.25) 

17 (2.35) 

10 (100%) 

na 

Duck Season (Temporada de 
patos) (2004) 

5 (3.38) 

19 (2.92) 

73 (90%) 

27 (74) 

Usphizin (2004) 

7 (3.91) 

17 (2.85) 

61 (93%) 

na 

Vinci (2004) 

2 (3.5) 

29 (3.97) 

na 

na 

Business, The (2005) 

3 (3.63) 

20 (2.48) 

na 

na 

Tony Takitani (2004) 

3 (3.38) 

22 (2.65) 

52 (88%) 

22 (88) 

Ghild, The (L’Enfant) (2005) 

6 (3.94) 

20 (3.26) 

na 

34 (87) 

Hidden Blade, The (Kakushi ken 
oni no tsume) (2004) 

4 (3.75) 

29 (4.01) 

31 (87%) 

11 (76) 

Three Times (Zui Hao De Shi 
Guang) (2005) 

4 (3.06) 

19 (2.68) 

50 (86%) 

22 (80) 

Taxidermia (2006) 

5 (4.16) 

19 (2.4) 

46 (80%) 

9 (83) 

Gui Si (Silk) (2006) 

1 (3.5) 

12 (2.26) 

5 (40%) 

na 

Arn - The Knight Templar (Arn 
- Tempelriddaren) (2007) 

3 (3.5) 

16 (2.56) 

na 

na 

Tell No One (Ne le dis a per¬ 
sonae) (2007) 

8 (3.71) 

34 (4.88) 

108 (94%) 

30 (82) 

Gzech Dream (Cesky sen) (2004) 

2 (3.75) 

19 (2.4) 

24 (79%) 

7 (72) 

4 Months, 3 Weeks and 2 Days (4 
luni, 3 saptamani §i 2 zile) (2007) 

35 (3.82) 

46 (4.5) 

133 (95%) 

37 (97) 

Om Shanti Om (2007) 

3 (3.88) 

25 (2.94) 

13 (77%) 

na 
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Movie title 

actual 
number 
of ratings 
(average) 

predicted 
number 
of ratings 
(pre¬ 
dicted 
average) 

Rotten 

Tomatoes 

(average) 

Metacritic 

(average) 

Aerial, The (La Antena) (2007) 

3 (4) 

48 (4.99) 

11 (64%) 

na 

Inside (A I’interieur) (2007) 

5 (3.13) 

17 (2.35) 

12 (83%) 

na 

Unknown Solider, The (Un- 
bekannte Soldat, Der) (2006) 

1 (3) 

13 (2.02) 

10 (60%) 

6 (71) 

Aleksandra (2007) 

1 (3) 

13 (2.02) 

na 

13 (85) 

Ganes (2007) 

2 (3) 

20 (2.22) 

na 

na 

Katyn (2007) 

4 (3.63) 

21 (1.8) 

64 (94%) 

17 (81) 

Maria Full of Grace (Maria, 
Llena eres de gracia) (2004) 

7(4) 

19 (2.44) 

139 (97%) 

39 (87) 

Veer Zaara (2004) 

2 (3.25) 

40 (4.99) 

na 

5 (67) 

Bad Education (La Mala edu- 
cacion) (2004) 

15 (3.79) 

30 (4.58) 

137 (88%) 

34 (81) 


Table D.2: The table lists the non-English movies that were predicted to receive a higher 
than the actual number of ratings. In general, these movies received very positive reviews 
from critics. The low number of ratings received by MovieLens users may be due to 
their demographics. We listed the number of ratings that were recorded by the websites 
Rotten Tomatoes and Metacritic. In addition, for the website Rotten Tomatoes the table 
displays the tomatometer score that represents the percentage of approved critics that 
have given the movie a positive review. For Metacritic we also show the metascore. The 
metascore ranges between 0 and 100, with 100 being the best possible score. 


Among the movies that our method predicted to receive a lower number of ratings 
than in reality are many that received very mixed or negative reviews from other websites. 
Since we do not have any information about the MovieLens users, we are unable to 
explain these results. It may be possible that in these cases, the user who first rated the 
movie usually does not watch movies in that particular genre. Another reason may be 
that these movies were highly anticipated and therefore received many ratings, although 
scores were generally low. 
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